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(2) INFORMATION FOR SEQ ID NO : 1 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "Primer" 
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(xi) SEQUENCE DESCRIPTION: SEC ID NO : 1 : 
CSGGSGSSGC SGGSTTCATS GG 
(2) INFORMATION FOP SEq ID N 0:2: 

(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pans 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GGGWVCTGGY VSGGSCCGTA GTTG -4 

(2) INFORMATION FOR. SEQ ID NO: 3: 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH : VM- base pairs 
{ B ) TYPE: nucleic acid 

(C) ST RAN DEDN ES 5 : single 

(D) TOPOLOGY: linear 

(n ) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

COCGGGCGGG GCGGGGTTCA TCGGCTCCGC CTACGTCCGC CGGCTCCTGT CGCCCGGGGC >>0 

C':CCGGCGGC GTCGCGGTGA ' 'CGTCOTCGA OAAAOTOACC TACGCCGGCA GCCTCGCCCG 12:0 

cctgcacgcg gtgcgtgaoc atccc"Ggcct cacgttcgtc cagggcgacg tgtgcgacac i^o 

cgcG':tcgtc gacacgctgg ccgcgcggca cgacgacatc gtgcacttcg cggccgagtc 24 0 

gcacgtcgac cgctccatca ccgacagcgg tgccttcacc cgcaccaacg TGCTGGGCAC 3i)0 

CCAGGTCCTG CTCGACGCCG CGCTOCGCCA CGGTGTGCGC ACCCTCGTGC ACGTCTCCAC 3 60 

'.".'ijACGAGGTG TACGGCTCCC TCCCGCACGG GGCCGCCGCG GAGAGCGACC CCCTGCTCCC 420 

gacctcgccg tacgcggcgt 1 'GAaggcggc ctcggacctc atggcgctcg cccaccaccg 4^0 

oacccacggc ctggacgtcc f iggtgacccg ctgttcgaac aactac ggcc cgcaccagtt lvl 0 

CCCGGG ( j4e 



(2) INFORMATION FoR SE<.) ID NO: 4: 

( l ) SEQUENCE CHARACTERl ST I CS : 

(A) LENGTH: S4 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CCCCGGGTGC TGGTAGGGGC CGTAGTTGTT GGAGCAGCGG GTGATGCGCA CGTCCAGGCC 60 

GTGGCTGACG TGCATGGCCA GCGCGAGCAG GTCGCCCGAC GCCTTGGAGG TGGCATAGGG 12 0 

GCTGTTGGGG CGCAGCGGCT CGTCCTCCGT CCACGACCCC GTCTCCAGCG AGCCGTAGAC 180 

CTCGTCGGTG GACACCTGCA CGAAGGGGGC CACGCCGTGC CGCAGGGCCG CGTCGAGGAG 24 0 

TGTCTGCGTG CCGCCGGCGT TGGTCCGCAC GAACGCGGCG GCATCGAGCA GCGAGCGGTC 300 

CACGTGCGAC TCGGCGGCGA GGTGCACGAC CTG'"TrrTf;G OOGGOOATGA CCCGGTCGAC 3 60 

CAGGTCCGCG TCGCAGATGT CGCCGTGGAC GAAGCGCAGC CGGGGGTGGT CGCGGACCGG 420 

GTCGAGGTTG GCGAGGTTGC CGGCGTAGCT CAGGGCGTCG AGCACGGTGA CGACGGCGTC 4 80 

GGGCGGCCCG TCCGGACCGA GGAGGGTGCG GACGTAGTGC GAG CC CAT GA ACCCCGCCGC 54 0 

C 541 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Ala Ala Gly Phe Met Gly Set His Tyr Val Arg Thr Leu Leu Gly Pro 
15 10 15 

Asp Gly Pro Pro Asp Ala Val Val Thr Val Leu Asp Ala Leu Ser Tyr 

20 25 30 

Ala Gly Asn Leu Ala Asm Leu Asp Pro Val Arg Asp His Pio Arg Leu 
35 40 45 

Arg Phe Val His Gly Asp lie Cys Asp Ala Asp Leu Val Asp Arg Val 

50 55 60 

Met Ala Gly Gin Asp Gin Val Val His Leu Ala Ala G\u S^r His Val 

6 5 7 0 7 5 HO 

Asp Arg Ser Leu Leu Asp Ala Ala Ala Phe Val Ai g Thi Asn Ala Gly 

8 5 9 0 

Gly Thr Gin Thr Ltu Leu Asp Ala Ala Leu Atg His Gly Val Ala Pio 
100 105 110 
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Phe Val Gin Val Ser Thr Asp Glu Val Tyr Gly Ser Leu Glu Thr Gly 

115 12 0 12 5 

Ser Trp Thr Glu Asp Glu Pro Leu Aig Pro Asti Ser Pro Tyr Ala Thr 
130 135 14 0 

Ser Lys Ala Ser Gly Asp Leu Leu Ala Leu Ala Met His Val Ser His 

14 5 150 155 160 

Gly Leu Asp Val Arq lie Thr Arg Cys Ser Asm Asn Tyr Gly Pro Tyr 
165 170 175 

Gin His Pro Gly 
180 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

( A ) T , RN uTH : 1 f 1 a m l n n a r i H s 
( B } T Y P E. : ammo a c i d 
(C) STRANDEDNESS : 
(Li) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Pro Gly Gly Ala Gly Phe lie Gly Ser Ala Tyr Val Arg Arg Leu Leu 



10 



15 



Ser Pro Gly Ala Pro Gly Gly Val Ala Val Thr Val Leu Asp Lys Leu 

..0 2 5 3 0 

Thr Tyi Ala Gly Ser Leu Ala Arg Leu His Ala Val Arg Asp His Pro 
35 40 45 

Gly Leu Thr Phe Val Gin Gly Asp Val Cys Asp Thr Ala Leu Val Asp 

50 55 60 

Thr Leu Ala Ala Arq His Asp Asp lie Val His Phe Ala Ala Glu Ser 
65 70 7 5 8 0 

His Val Asp Arg Ser He Thr Asp Ser Gly Ala Phe Thr Arq Thr Asn 

8 5 90 9b 

Val Leu Gly Thr Gin Val Leu Leu Asp Ala Ala Leu Arg His Gly Val 
100 105 110 

Arq Thr Leu Val His Val Sei Thr Asp Glu Val Tyr Gly Ser Leu Pro 

115 120 12 5 

His Gly Ala Ala Ala Glu Ser Asp Pro Leu Leu Pro Thr Ser Pro Tyr 

130 135 140 

Ala Ala Sei Lys Ala Ala Ser Asp Leu Met Ala Leu Ala His His Arg 



Th: His Gly L*-u Asp Val Aig Val Tin Arg Cys S«-i Ami Asn Tyi Gly 
165 170 175 



Pro His Gin Phe Pro 

1 H 0 
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(2) INFORMATION FOP SEO H> NO : 7 : 

:i) SEQUENCE ( " HARAC T F R I T I C S : 

(A) LENGTH: hQll base pairs 

(B) TYPE: nucleic acid 

(C) S T PA! J D E DN E S S : single 
(I') TOPOLOGY: linear 

(ii) MOLEG1JLE TYPE: UNA (-genomic) 



(:<i) SEQUENCE INSCRIPTION: SEQ ID NO: 7: 
CTGCAGGGTT CCGTGGTGOA CGAGGCGCCO CTGGTCGACG ACCAGGGCGC TGTOGCAGAT 60 

prprr/"^T'p,- rn,-, - a Ti 'T> i Taif-^-'i-MTirrT r; \*zc J^jr C ?\C '-Z CTGOTGCCGA GTT' "CCGGTG 12 0 

ggcgcggttg AOGAGCCGGC GGAGGGGGTG CTTCAGCACC ATGTGGAGGC CGATGGTGGG 180 

CTCGTCCCAG AAGAGCACGG OCGGGTCGTG CAGCAGGCTC GCCGCGATCT CGGCGCGCAT 24 0 

GCGCTGTCOG AGGGTGAGGT GGGGGACGGG GGTGGAGCGG AGCGCGTCGA TGTGGAGGAG 300 

gtggcggaac agggogaggt tgggggggta gaggggtggg gggatgtcgt agatgcggcg 3 60 

oaggatgggg aaggagtcgg gtagggacag gtccgagcag aggtggctgg ggtgggcgaa 420 

gag gag go gg atggtggggg ggttgcgctg gcggtggggg TAGGGGTGCA GGCGGGCGAO 4 HO 

cgtgcagcgg c- :ggaggtgg gggtgatgat gcgggtgagg atcttgatcg tggtcgactt mo 

ggggggtggg ttgggggcga tgtaggoggt cttcgtgcgg gccggtatct cgaaggagag 600 

gtggtggagg ggggggagga gggggtaccg GGGGGTCAGG AGGGTGGAGA GGCTGCCGAG 660 

GAGGCGGGGG tggggttggg GGAGGCGGAA ctccttgacg AGGTGTTCGG GGAGGATCAG 7.!0 

gggatgacgg ggtggagggc ggtgtcgagg aggcgcaggg ggtggtcgag gagggcgtgg 780 

tcgagggtga aggggggtgc gaggoggagg atgtggccgc gcagggaggt gcggagccgg 84 0 

AGGTGGAGGG OGGTGGTGTA gacggcccgg ggggtgtggg gggggggtgg GGGGGGGAGG 9 ')0 

GCGTCGGTGA cgaagtcgag gggggagagg agtccgaggc CGCGTACCTG GGGGAGGTGG 960 

GGGAAGGGGG AGTOGAGGGG GGGCAGCCGC TOGTGGATGA GGTGGGGGAG GAOGGGGAGG 102 0 

CGGTGGATGA GGGGGTGGOG GTGGAGGAGG TOGAGCGTGG CGCGGGCGGG GGCGATGCCG 1080 

AGTGGGTTGC TOGOGTAGGT CGAGGCGTAG GGCCCGGGGT GGCCGCCTCC GGGGTGGGGA 1140 

GCTTOOGGGG GT' "GG« ''"AG CAGGGGGAAG GGGAATGGGC TOGGGGTGOG GTTGGACAGG 1200 

ATGGGGAGGT G< ;GGGTOGAT GCCGAACAGT TGGCTGGCGA GGAAGGCGCC GGTGCGCCCG 12 60 

CGGGGGGTGA GGACCTGGTV 'G'GA' -..AG. ' AGGAOGOOGO OGTGGGGGGA GG GGGC GGGG 1320 

ATCCGCTGCC AGTAGO' 3GGG GGGGGGGAGG ATGAGGGGTG GGGGGGGGAG GACGGGTTCG 1380 

AAGAGCAGGG GGGAGAGGTT GGGCTTCTCC GGGATGTGGG GGGGGAGGAG GGTGGGGGAG 1440 



Ah 




CGCACGTCGC ACGAGGGGTA CTCCAGGCCC AGGGGACAGC GGTAGCCAGT AGGGGCTGTA 1500 

GCCAGCACGC TGTTGCCGCT GAAGGCCTGG TGGCCGATGT CCCAGTGGAC CAGCATCCGG 1 [ ..>00 

GCGCCCATGG TCTTGCCGTG GAAGCCGTGG CGCAGGGCGC AGATCCGGTT GCGGCCCGGC 1>>20 

GCGGCGGTCG CCTGGACGAC CCGCAGGGCG GCCTCGACCA CCTCCGCGCC GGTGGAGAAG I^'.hO 

aaggcgtagg tgtcgagctg ttcgg gcagc agcctggcga gcagttccag caggccggcg 1'mo 

cggtccggcg tggcgctgtc gtggacgttc cacaggcggc gggcctgggt ggtgagtgo; ihiii 

TCGACGACCT CCGGGTGCCC GTGGCCCAGT GACTGGGTGA GGGTCCCGGC CGCGAAGTCG l^OU 

AGGTACTGGT TGCCGTCCAG GTCGGTCAGA ACGGGACCGC gtccctcggc GAAGACCCGG l'.cu 

cgtccgtgga cggcttcctc ggaggcgccc ggcgccaggt ggcgggcctc ccgtgccagg i960 

TGCTGTGTCT GCCGTAAGCC TGTC ATCGCT GCCTCTGCTC 1 iTCGGACCGG CTGACGCGAT 204 0 

cgccggcgaa ctgcgttgtg gcgcaccacg gttggggcgg '.:tcggi:;gctg agtcaaacac 2 ion 

ttgaacacac accgctgcaa gagtttgcgg gttgtttcag aa^gttgttg cgagcggccc 2 1.00 

CGGCACTCTG GTTGAGT CGA CGTGCTTACG GCGCCACCAC GCCTCACGTT CGAGGAGGGA 2220 

CCTGTGAGAA CAAGCCCGCA gaccgacccg CTCCCGCGGA GGCCGAGGTG AAGGCCCTGG l\. :>.'•>) 

TCCTGGCAGG TGGAACCGGC AG CAGACTGA GGCCGTTCAC GCACACCGCC GCCAAGCAGG 2 34 0 

TGCTCCCCAT CGCCAACAAG CCCGTGCTCT TCTACGCGCT GGAGTCCCTC GCCGCGGCGG .:4ni.) 

GTGTCCGGGA GGCCGGCGTC GTCGTGGG :G CGTACGGCCG GGAGATCCGC GAACTCACCG /4'jO 

gcgacggcac cgcgttcggg ttacgcatca cctacctcca ccagccccgc ccgctcggtc 

TCGCGCACGC GGTGCGCATC GCCCGCGGCT TCCTGGGCGA CGACGACTTC CTGCTGTACC 2 ^ 0 

TGGGGGACAA CTACCTGCCC CAGGGCGTCA CCGACTTCGC CCGCCAATCG GCCGCCGATC 2 64 0 

CCGCGGCGGG CCGGCTGCTG CTCACCCCGG TCGCGGACCC GTCCGCGTTG GGGGTGGCGG 2 7'.)0 

AGGTCGACGC GGACGGGAAC GTGCTGCGGT TGGAGGAGAA ACCCGACGTC CCGCGCAGCT 2 7 60 

CGCTCGC GOT CATCGGCGTG TACGCCTTCA GCCCGGCCGT CCACGAGGCG GTACGGGCCA 2 8 20 

TCACCCCCTC CGCCCGCGGC GAGCTGGAGA TCACCCACGC CGTGCAGTGG AT GAT C GAG C .1660 

g g ggcctgcg cgtacgggcc gagaccacca cccggccctg gcgcgacacc ggcagcgcgg 2 9-10 

aggacatgct ggaggtgaag cgtcacgtcc tggacggagt ggagggccgc atcgagggga 3000 

AGGTCGACGC GCACAGCACG CTGGTCGGCO GGGTCCGGGT GGCCGAAGGC GCGATCGTGC 3 0 00 

GGGGGTCACA CGTGGTGGGC CCGGTGGTGA TCGGCGCGGG TGCCGTCGTC AGCAACTCCA 3120 

GTGTCGGCCC GTACACCTCC ATCGGGGAGG A'"TGCCGGGT CGAGGACAGC GCCATCGAGT 3 1 H i.) 

ACTCCGTCCT GCTGCGCGGC GCCCAGGTCG AGGGGGCGTC CCGCATCGA G 2GT-2- "F- 'A 324 0 

T< '',',(•■: ; <••;;;./< -gt- "CP" ggcccggcvv v vcgtctccc gcaggct-. 'Ac cgactggtga 3 3 00 

tcggcgacca cagcaaggtg tatctcaccc catgaccacg accatcctcg TCACCGGCGG 3 360 



AGCGGGCTTO 
CGTCGCGGTG 
GGTGCGTGA 1 ' 
CGACACGCTG 
CCGCTCCATC 
GCTCGACGC* : 
GTACGGCTCC 
GTACGCGGCG 
CCTGGACGTC 
GCTCATACCG 
CGGGCGGCAC 
CCGCGTGTCG 
CCTGGAGCTC 
CGTCGAGAAC 
GGAACTCGGT 
GTACGAGCGG 
ACCGCAACCA 
CGTCCGTGAG 
AGTTCOGGCA 
AGTGCGTCCA 
GGGT CGGTGA 
ACCGTGAGAC 
CACATGACGG 
AGGGAGAACA 

tcccgcacgc 
ccgacggcga 
atcgcgg< :<:c 
acgaggacga 
gtgtccgcgt 
tccgtgagtt 
ccctgao:_-.cg 
agttcggggg 



ATTCGCTCCG 
A<"CGTCCTCG 
CATCCCGGCC 
GCCGCGCGGC 
ACCGACAGCG 

gcgctccgoc 
ctcccgcacg 

TCGAAGGCGG 
CGGGTGACCC 
CGCTTCCTGA 

gtgcgcgact 
ggccggccgg 
acgcaccggt 
ggcaaggggc 
taccggccgc 
cacgaggact 

CCGGCOCCGG 
CCGGGCGCCG 
TGAAGACGAA 
CGGCGACCTG 
AGGCC ATGAG 
CCCGCCGGCG 

cggtgcagcc 
gcgagtgctg 

CGGCOCGGAA 
' SCCGA' "GCG 
GGTC< iT<;;;GG 
AGGG 1 :ACCTG 

gcagt ccgga 
cgtcctcggt 
igagcgcctg 
tgaccagtcc 



CCTACGTCCG 
ACAAACTCAC 
TCACCTTCGT 
AC G AC GAG AT 
GTGCCTTCAC 
ACG<;TGTGC^ 
GGGCCGCCGC 
CCTCGGACCT 
GCTGTTC GAA 
CCAGCCTCCT 
GGCTGCACGT 
GAGAGATCTA 
TGCT CGCACT 
ACGACCGG< 'G 
GCACCGACTT 
GGTGGCGTCC 
CCGGCACA 
GCCGCCTCgC 

ttcggtgcgc 
cco/atcgcc 
cggcgagtgg 
ggcggcccgc 

CAGGTCGATC 
CACGAGCTCC 
CCCCTCGATC 
GTG'.JTG 1 V' V 
G GAG AC 1 iAAG 

GAG GAA G AT G 



CCGGCTCCTG 
CTACGCCGGC 
CCAGGGGGAC 
CGTGCACTTC 
CCGCACCAAC 

cacgttcgtg 

G GAG AGO" GAG 

catggcgctc 
caacttoggo 

GTCCGGCGGC 

cgacgaccac 
caacatcggg 

GTGCGGCGCG 

cta:g':;ggtg 

C GCG AC' ""GCG 
CCTGCT'"GCC 



TCGCCCGGGG 
AGi'CTCGCCC 
GTGTGCGACA 
GCGGCCGAGT 
GTGCTGGGCA 
CA' *GTOT'"CA 
CCCCTGCTTC 
GCOCACCACC 
CCCCACCAGC 
ACCGTTCCCC 
GTOAGGGCCG 
GGCGGCA''"GT 
GG : "C< "'..-GAGC 
GA : "CACAGCA 



CGAGCCGCCC 
CCCCATCACG 
GACCAGCTCG 



Z GCCG( *C' "GCG 
Z GGGC<"GGCGG 
GGCG< iC< 'G^'G 

ttga< 'gggct 
tcgaagccga 
acggcgccga 
ag«:G' :ggacg 

TCGGACTCCC 
TT< :GGi TG'~A 

agotc< 'gcca 

GGTGC' TCGA 
AGCOGGCCGT 
ATGCO^GA'"A 
GGG'yr-'TG'JG 
T GGGC ! "AGTG 
GCGCGGOGCT 



CTGG< "CGACA 
GCGA'.'ATGAC 
GCCGGTGGCC 
CG-T'^GAOOC 
TT' "C ;CTCAT 
GT<"TOAT<.,GT 
CCAO "GAGAT 

gggo "ATCat 
cggoggc ctg 

G' "GO CG A'" AC 
G« 'GG' :aggaa 
GGTGCGCCAC 

T ( "« ;gggggga 

ACCGTCCGGT 

c^gcgcggtc 

TGGCGAGCAC 
OGG'iGAAGAA 
GTCGOGCCGG 



CCCCCGGCGG 
GCCTGCACGC 
CCGCGCTCGT 
CGCACGTCGA 
CCCAGGTCCT 
CCGACGAGGT 
CGACOTCGCC 
GCACGCACGG 
ATCCCGAGAA 
TCTACG GCG A 

togaactcgt 

CGOTGCCCAA 
GCAT'/GTCCA 
AGATCACGGC 
CC GCGAAGTG 
GTCGGGOCGG 
GG < 'GGGTCAG 
CC< tGAG»"A</C 
CT' XTC" "AGC 
GGTGAGGGGA 
GT l :ACCGGGA 
GTCGCTGGCG 

gooccg' :tcg 
toccaggtgo 



GCGGGCGGGC 

ggo ;aggcgc 
gaacccgttc 
' 'tcggcggtg 
c ac gagc at c 
gggcg'i gtag 
ggggt'1 gtcc 

OTGCTt GTAG 



3420 

3 4 80 
3S4 0 
3000 
3 660 
3720 

3 780 
38 4 0 
3900 
3960 

4 02 0 
40H0 
4 14 0 
4 2 00 
4 2 6 0 
4 320 
4 3 8 0 
4 4 4 0 
4 0 0 0 
4 0 60 
4 62 0 
4C>80 
47 4 0 
4 8i)0 
4 8 60 
4^2 0 
4 8 0 

0 0 4 0 

01 0 0 
9i 60 
r > 2 2 0 
0 2 H 0 
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CCCAGCGCGT CCAGTGCGGT CAGCACCGAG TCGCGGGTGC CGGTGGCCAC ACCGCGCGCA 'j 3 4 0 

CGGTTOAGGA OGCGGGTGAG CGTGGCCTTG CTGACGCCCG CCCGGGCTGC GATGTGGGOG '.4 00 

AGOGG' "ATGG TCATGGCAAC GOAGTGTACG TGTCGGGGCG TCAGGGCGTG CCCACCGCGC S4 00 

gcggaaccgg cgga<:tg>.:gg ggcacggccc gtccggcgcc gagggaggag gcgggggaaa i*h?.o 

CGATGGGTGA AAATGCTTGC AGGAAATTGG CGCAACGTCT TTCGGCGGCT TTTCGATCCT SbHO 

GTTAGGTTGG TGGGAACCCC GGCGCCGCGC AGAAGGGGTT GGCGTGAGGC GTGGAGAOOT SC4 0 

CGGCOOGATT CCGGGATCAC TCAGGGGAGT TCACAATGCG GCGTGGCATT GCGGCCACGG S7 00 

CGCTGTTCGG GGGTGTGGGG ATGACGGCAT CGGGGTGTGG CGGGGGCGAC AACGGCGGAA SVi.^O 

ggggtagoga cgggggcggc ACGGAGCTGT CGGGGACCGT CACCTTCTGG GACACGTCCA !)8;:0 

Arc;AAi ;< vr;/\ n A AGGOGAOG TACOAGGGOO TGGGGGAGGG GTTGGAGAAG GAGGAOGGGA S^'O 

AGGTGGAGGT CAAGTACGTC AACGTCCCGT TCGGCGAGGC GAACGCCAAG TTGAAGAAGG S94 0 

gggcgggggg caagtccggt gccccggagg tgatgggtag ggaggtcgcc tgggtggggg gouo 

AOTTGGGGAG OATOGGCTAC CTCGCCCCGC TGGAGGGGAC GCCCGCCCTC GAGGAGGGGT ^O'jO 

GGGAOGAOO'F TOGGGAGGGC GGCAGCACCA GGTACGAGGG gaagagctag gcggtggggg *j].;:o 

AGGTGATGGA GA^'GTGGOG OTGTTGTAOA ACAAGGAACT GCTGACGAAG GCGGGTGTOG >mH0 

AGGTGGGGGG GTOCCTCGCC GAGCTGAAGA CGGCCGGCGC C GAG AT GAG C G AG AAG AG GG 'jMO 

gggcgagggg ggtgtagtgg ggggcgacga cccgtaottg gttcctgccc taggtgtagg 0300 

gggagggogg ggaggtggtg gaggagaaga agaagagggt cacggtcgac gaggaagggg 03»i0 

GTGTGOGOGG GTAOGGGGTG atgaaggagc TGGTGGAGAG GAAGGGGGGG ATCACCGACG 04.10 

GGTCGGAOGG CTGGAACAAC ATGGAGAAGG CCTTCAAGTC GGGCAAGGTC GC GAT G ATGG u4 8 0 

TCAACGGGCG GTGGGCCATC GAGGACGTCA AGGCGGGAGC CCGCTTCAAG GACGCCGGGA »O;40 

ACCTGGGGGT OGOOGCOGTO CCGGCGGGGA GTGGGGGAGA GGGCTCTCCC CAGGGCGGGT GbOO 

GGAACCTCTC GGTGTAGGCG GGGTCGAAGA ACCTCGACGC CTCCTACGCC TTCGTGAAGT 0G«S0 

AGATGAGOTG OGGG AAGGTG GAGCAGGAGA G GAGCGAGAA GCTGAGCGTG CTGCGCACGO »j7.-0 

GGAGGTGOGT GTAGGAGGTG C CGTCCGTGG CGGACAACGA GAT GGTGAAG TTCTTCAAG'"* -.7 8 0 

CGGCGGTGGA OAAGGCC GT C GAACGGGCGT GGATCGGGGA GGGCAATGGC CTCTTCGAGG GH4 0 

CGATC CGGCT GGAG (>bb4 

['/) INFORMATION EOF. SEv ID NO : 8 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) TFNOTH: ^4 0 .^minn .*nd.s 
( B ) TV RE : anr.i no a c i d 
(Gj ST F"/vf>J L'EL'N ES S : 
( 10 TOPOLOGY : 1 l 

(li) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: S EQ ID NO : 8 : 

Val lie Val Ala Glu His Leu Val Lys Glu Phe Arg Leu Ala Glu Arg 
1 5 10 IS 

Glu Pro Gly Leu Leu Gly Sei Leu Sei Thr Leu Leu Thr Arg Arg Tyr 

2 0 2b 3 0 

Arg Val Val Arg Ala Val Asp Asp Val Ser Phe Glu He Pro Ala Gly 
3 5 4 0 4 5 

Thr Lys Thr Ala Tyr He Gly Ala Asn Gly Ala Gly Lys Ser Thr Thr 

5 0 5 5 CO 

He Lys Met Leu Thr Gly lie Met Thr Pro Thr Ser Gly Arg Cys Thr 

t^S 7 0 7 5 Rf) 

Val Ala Gly Leu Glu Pro Tyi Arg His Arg Gin Arg Asn Ala Arg Thr 

85 ~ 90 95 

He Gly Val Val Phe Gly Gin Arq Ser Gin Leu Trp Trp Asp Leu Ser 
100 105 110 

Val Pro Asp Ser Phe Arg He Leu Arg Arg He Tyr Asp Ho Pro Gly 
115 120 125 

Pro Val Tyr Arg Arg Asn Leu Ala Leu Phe Arg Asp Leu Leu Asp He 
130 " 135 140 

Asp Ala Leu Gly Ser Thr Pro Val Arg Gin Leu Ser Leu Gly Gin Arg 

145 150 155 160 

Met Arc} Ala Glu He Ala Ala Sei Leu Leu His Asp Pro Ala Val Leu 

105 170 175 

Phe Trp Asp Glu Pro Thr He Gly Leu Asp Met Val Leu Lys Asp Ala 

; 8 0 1 8 5 1 9 0 

Val Arg Arg Leu Val Asn Arq Ala His Arg Glu Leu Gly Thr Thr Val 

195 200 205 

Val Leu Thr Ser His Asp He Ala Asp He Ala Ala He Cys Asp Sei 
210 215 220 

Ala Leu Val Val Asp Gin Gly Arg Val Val His Gin Gly Thi Leu Gin 

22 5 2 30 2 35 24 0 



(2) INFORMATION FOR S EQ ID NO : : 

( i ) S EQU EN C E C H A RACT E R I S T I C S : 

(A) LENGTH: 429 ammo a rids 

(B) TYPE: ammo and 
(O) STRANDEDNESS : 

[ P ) TOPOLOGY : 1 i nea i 

(ii) MOLECULE TYPE: pi ot .-i n 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Thr Gly Leu Arg Gl il Thr Gin His Leu Ala Arg Glu Ala Arg His 
1 5 10 15 

Leu Ala Pro Gly Ala Sei Glu Glu Ala Val His Gly Arg Arg Val Phe 

20 25 30 

Ala Glu Gly Arg Gly Pro Val Leu Thr Asp Leu Asp Gly Asn Gin Tyr 

35 " 40 45 

Leu Asp Phe Ala Ala Gly Thr Leu Thr Gin Ser Leu Gly His Gly His 

50 55 60 

Pro Glu Val Val Glu Ala Leu Thr Thr Gin Ala Arg Arg Leu Trp Asn 

65 70 75 80 

Val His Asp Ser Ala Thr Pro Asp Arg Ala Gly Leu Leu Glu Leu Leu 

PS q 0 9 5 

Ala Arg Leu Leu Pro Glu Gin Leu Asp Thr Tyr Ala Phe Phe Ser Thr 
100 105 HO 

Gly Ala Glu Val Val Glu Ala Ala Leu Arg Val Val Gin Ala Thr Ala 
115 120 125 

Ala Pro Gly Arg Asn Arg lie Cys Ala Leu Arg His Gly Phe His Gly 
130 135 140 

Lys Thr Met Gly Ala Arg Met Leu Val His Trp Asp lie Gly His Gin 
145 150 155 160 

Ala Phe Ser Gly Asn Ser Val Leu Ala Thr Ala Pro Thr Gly Tyr Arg 
165 170 175 

Cys Pro Leu Gly Leu Glu Tyr Pro Ser Cys Asp Val Arg Cys Ala Thr 
18 0 18 5 19 0 

Leu Val Arg Arq His lie Ala Glu Lys Pro Asn Val Ser Ala Leu Val 

195 " 200 205 

Phe Glu Pro Val Leu Gly Ala Ala Gly Val lie Val Pro Pro Pro Gly 

210 215 220 

Tyr Trp Glu Arq He Ala Gly Ala Cys Arg Asp Gly Gly Val Leu Leu 

225 230 235 240 

Val Ala Asp Glu Val Leu Thr Gly Gly Gly Arg Thr Gly Ala Phe Leu 

24 5 250 255 

Ala Sei Glu Leu Phe Gly lie Glu Pro Asp Leu Ala Met Leu Ser Lys 

260 265 270 

Gly Thr Ala Ser Gly Phe Pro Phe Ala Val Leu Ala Gly Arg Ala Glu 

2 1 b 2 8 0 2 8 5 

Ala Ala Gin Ala Gly Gly Gly His Pro Gly Ala Tyr Ala Ser Thr Tyr 

2 ° 0 2 9 5 3 0 0 

Ala Sei Asn Pro Leu Gly He Ala Ala Ala Arg Ala Thr Leu Glu Val 

10b 310 315 320 

Val Glu Arq Asp Arg Leu lie Asp Aug Val Arg Val Leu Gly Glu Leu 

Y?b 3 30 3 35 



50 



lie Gin Glu Arg Leu Arg Ala Leu Glu Ser Arg Phe Fro Gin Leu Gly 

340 34 5 350 

Gin Val Arg Gly Leu Gly Leu Leu Tip Gly Leu Glu Phe Val Thr Asp 

355 360 365 

Ala Val Gly Arg Ala Pro Ala Pro Glu Thr Ala Arg Ala Val Tyr Thi 

370 375 380 

Thr Ala Leu Asp Leu Gly Leu Arg Tin Ser Leu Gly Gly His lie Leu 

385 39 0 395 40 0 

Arg Leu Ala Pro Pro Phe Thr Leu Asp Glu Ala Leu Leu Asp Glu Gly 
4 05 4 10 4 15 

Leu Arg Leu Leu Glu Thr Ala Val Glu Arg Val lie Ala 
4 2 0 4 2 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) ST RAN Li EDN ESS : 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Lys Ala Leu Val Leu Ala Gly Gly Thr Gly Ser Arg Leu Arg Pro 
15 10 15 

Phe Thr His Thr Ala Ala Lys Gin Leu Leu Pre lie Ala Asn Lys Pro 

2 0 2 5 30 

Val Leu Phe Tyr Ala Leu Glu Ser Leu Ala Ala Ala Gly Val Arg Glu 
3 5 4 0 4 5 

Ala Gly Val Val Val Gly Ala Tyr Gly Arg Glu lie Arg Glu Leu Thr 
50 55 00 

Gly Asp Gly Thr Ala Phe Gly Leu Arg He Thr Tyr Leu His Gin Pio 

65 70 75 80 

Arg Pro Leu Gly Leu Ala His Ala Val Arg lie Ala Ar :j Gly Phe Leu 

8 5 9 0 9 5 

Gly Asp Asp Asp Phe Leu Leu Tyi Leu Gly Asp Asn Tyi Leu Pro Gin 
10U 105 110 

Gly Val Thr Asp Phe Ala Arg Gin Ser Ala Ala Asp Pio Ala Ala Ala 
115 1 2 0 1 2 5 

Arg Leu Leu Leu Thr Pro Val Ala Asp Pro Ser Ala Ph»* Gly Val Ala 

130 135 140 

Glu Val Asp Ala Asp Gly Asn Val Leu Ar q Leu Glu Glu Lys Pro Asp 

14 5 150 155 100 
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Val Pro Arg Set Ser Leu Ala Leu lie Gly Val Tyr Ala Phe Set Pro 
165 170 175 

Ala Vdl His Glu Ala Val Arg Ala lie Thr Pro Ser Ala Arg Gly Glu 

1 8 U 1 8 5 19 0 

Leu Glu He Thr His Ala Val Gin Trp Met He Asp Arg Gly Leu Arg 

195 ::oo 205 

Val Arg Ala Glu Thr Thr Thr Arg Pro Trp Arg Asp Thr Gly Ser Ala 

210 215 220 

Glu Asp Met. Leu Glu Val Asn Arg His Val Leu Asp Gly Leu Glu Gly 

225 230 235 240 

Arg He Glu Gly Lys Val Asp Ala His Ser Thr Leu Val Gly Arg Val 
245 250 255 

Arq Val Ala Glu Gly Ala He Val Arg Gly Ser His Val Val Gly Pro 

260 265 270 

Val Val lie Gly Ala Gly Ala Val Val Set Asn Ser Ser Val Gly Pro 

2 7 5 2 8 0 2 8 5 

Tyr Thr Ser He Gly Glu Asp Oys Arg Val Glu Asp Ser Ala He Glu 

290 295 300 

Tyr Ser Val Leu Leu Arq Gly Ala Gin Val Glu Gly Ala Ser Arg He 

305 310 315 320 

Glu Ala Ser Leu He Gly Arg Gly Ala Val Val Gly Pro Ala Pro Arg 
325 330 335 

Leu Pro Gin Ala His Arq Leu Val He Gly Asp His Ser Lys Val Tyr 
340 345 350 

Leu Thr Pre 

3 5 5 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 ammo acids 

(B) TYPE: ammo acid 

(C) STFANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO : 1 1 : 

Met Thi Thr Thr lit* Leu Val Thr Gly Gly Ala Gly Phe He Arg Set 

1 5 1 0 1 5 

Ala Tyr Val Ai q Arq Leu Leu Ser Pro Gly Ala Pro Gly Gly Val Ala 

2U 2 5 3 0 

Vdl Th: Val U-u Asp Lys Leu Thr Tyr Ala Gly Sei Leu Ala Arg Leu 
V'. 4 0 4 5 
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His Ala Val Arg Asp 

50 

Cys Asp Thr Ala Leu 

65 

Val His Phe Ala Ala 

85 

Gly Ala Phe Thr Arg 

100 

Ala Ala Leu Arg His 
115 

Glu Val Tyr Gly Ser 

130 

Leu Leu Flu Tin. S e l 
14 5 

Met Ala Leu Ala His 

165 

Arg Cys Ser Asn Asn 
180 

Pro Arg Phe Leu Thr 
195 

Gly Asp Gly Arg His 

210 

Arg Ala Val Glu Leu 

22 5 

Asn lie Gly Gly Gly 

24 5 

Leu Leu Ala Leu Cys 

2 60 

Asn Arg Lys Gly His 

27 5 

Thr Ala Glu Leu Gly 

290 

Ala Asp Thr Ala Lys 

305 



His Pro Gly Leu Thr 

55 

Val Asp Thr Leu Ala 

70 

Glu Ser His Val Asp 
90 

Thr Asn Val Leu Gly 

105 

Gly Val Arg Thr Phe 

120 

Leu Pro His Gly Ala 
135 

Fiu Tyi Aid Aid Sei 

150 

His Arq Thr His Gly 
170 

Phe Gly Pro His Gin 

185 

Ser Leu Leu Ser Gly 

200 

Val Arg Asp Trp Leu 

2 15 

Val Arg Val Ser Gly 

230 

Thr Ser Leu Pro Asn 

250 

Gly Ala Gly Pro Glu 

2 65 

Asp Arq Arg Tyr Ala 

280 

Tyr Arg Pro Arq Thr 

295 

Trp Tyr Glu Arg His 

310 




Phe Val Gin Gly Asp Val 

60 

Ala Arg His Asp Asp lie 

75 8 0 

Arg Ser He Thr Asp Ser 

95 

Thr Gin Val Leu Leu Asp 

110 

Val His Val Ser Thi Asp 
125 

Ala Ala Glu Ser Asp Pro 
140 

Lyi Aid nici jci ASp lcu 

155 160 

Leu Asp Val Arg Val Thr 
175 

His Pro Glu Lys Leu He 
190 

Gly Thr Val Pro Leu Tyr 

205 

His Val Asp Asp His Val 

22 0 

Arg Pro Gly Glu He Tyr 

2 3 5 2 4 0 

Leu Glu Leu Thr His Arg 

255 

Arg lie Val His Val Glu 

2 7 0 

Val Asp His Ser Lys lie 

285 

Asp Phe Ala Thr Ala Leu 

300 

Glu Asp Trp Trp Arg Pro 

315 320 



Leu Leu Ala Ala Thr 

32 5 



( 2 ) I N FO RMAT I ON FO H S EQ I D N G : 1 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 amino acids 
( B } TYPE: amino acid 
(C) STRANDEDNESS : 
([)) TOP-:) LOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ IV NO: 12: 

Met Thr Met Arg Leu Ala Asp He Ala Ala Arg Ala Gly Val Ser Lys 
15 10 15 

Ala Thr Val Ser Arg Val Leu Asn Gly Ala Arg Gly Val Ala Thr Gly 

20 25 30 

Thr Arg Asp Ser Val Leu Thr Ala Leu Asp Ala Leu Gly Tyr Glu Gin 
35 40 45 

Pro Ala Arq Gin Arg Arg Ala Glu Leu Val Gly Leu Val Thr Pro Glu 

50 " 55 60 

Leu Asp Asn Pro Phe Phe Pro Ala Leu Ala Gin Val Met Gly Gin Ala 

fiS 70 75 30 

Leu Thr Arq Gin Gly Tyr Thr Pro Val Leu Ala Thr Gin Thr Pro Gly 
85 90 95 

Gly Ser Thr Glu Asp Glu Leu Thr Glu Met Leu Val Asp Arg Gly Val 
100 105 HO 

Ser Gly lie lie Phe Val Ser Gly Leu His Ala Asp Thr Thr Ala Glu 

115 12 0 12 5 

Thr Gly Arq Tyr Gly Arg Leu His Glu Arg Gin Val Pro Phe Val Leu 
130 135 14 0 

Val Asn Gly Phe Ser Pro Arg He Glu Ala Pro Phe Val Ser Pro Asp 
14 5 150 155 100 

Asp Arg Ala Ala Met Arq Leu Ala Val Ala His Leu Ala Glu Leu Gly 
165 " 170 175 

His Glu Arg Val Gly Leu Ala Val Gly Pro Ala Arg Phe Val Pro Val 

180 185 190 

Gin Arg Lys He Glu Gly Phe Arg Ala Gly Val Arg Glu His Leu Gly 

195 200 205 

Val Ser Ala Arg Glu Sei Glu Glu Leu Val Gin His Ser Leu Phe Ser 

210 215 220 

Leu Glu Gly Gly Gin Ala Ala Ala Ser Ala Leu He Asp Leu Gly Cys 

225 230 235 240 

Thr Ala Val Met Cys Ala Ser Asp Met Met Ala Leu Gly Ala Val Arg 

245 250 255 



Ala Ala Arq Arq Arq Gly Leu Thr Val Pro Gly Asp He Ser Val Val 

;q;,q 205 270 

Gly Phe Asp Asp Ser Pro Leu Met Ala Phe Thr Asp Pro Pro Leu Thr 

2 7 5 ? h n '? H S 

Thr He Arq Gin Pro Val Lys Ala Met Gly Gin Val Ala Val Asp Ala 

2Qij 2 95 3 00 

Leu Leu Glu Glu Met Ser Gly Thr Pro Pro Pro Arg Thr Glu Phe Val 

305 310 315 320 

54 



Phe Met Pro Glu Leu Val Val Arg Gly Ser Thr Ala Ala Gly Pio Arg 

32 S 3 30 3 35 

Gly Gly Arg Arg Pro Ala His Gly Aig 
340 345 

(2) INFORMATION FOR SEQ ID NO : 1 3 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear. 

(li) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Arg Arg Gly lie Ala Ala Thr Ala Leu Phe Ala Ala Val Ala Met 
1 ' 5 10 15 

Thr Ala Ser Ala Cys Gly Gly Gly Asp Asn Gly Gly Ser Gly Thr Asp 

20 25 30 

Ala Gly Gly Thr Glu Leu Ser Gly Tht Val Thr Phe Trp Asp Thr Ser 

35 40 45 

Asn Glu Ala Glu Lys Ala Thr Tyr Gin Ala Leu Ala Glu Gly Phe Glu 

50 55 60 

Lys Glu His Pre Lys Val Asp Val Lys Tyr Val Asn Val Pro Phe Gly 
65 70 75 80 

Glu Ala Asn Ala Lys Phe Lys Asn Ala Ala Gly Gly Asn Ser Gly Ala 

85 90 95 

Pro Asp Val Met Arg Thr Glu Val Ala Trp Val Ala Asp Phe Ala Ser 
100 105 HO 

He Gly Tyr Leu Ala Pro Leu Asp Gly Thr Pro Ala Leu Asp Asp Gly 
115 120 125 

Ser Asp His Leu Pro Gin Gly Gly Ser Thr Arg Tyr Glu Gly Lys Thr 
130 135 140 

Tyr Ala Val Pro Gin Val lie Asp Thr Leu Ala Leu Phe Tyr Asn Lys 

1 4 5 1 5 0 1 c > r "> 1 6 0 

Glu Leu Leu Thi Lys Ala Gly Val Glu Val Pro Gly Ser Leu Ala Glu 
165 170 -175 

Leu Lys Thr Ala Ala Ala Glu lie Thr Glu Lys Thr Gly Ala Ser Gly 
180 185 190 

Leu Tyr Cys Gly Ala Thr Thr Arg Thr Trp Phe Leu Pro Tyr Leu Tyr 

i <"< 5 2 00 20 5 

Gly Glu Gly Gly Asp Leu Val Asp Glu Lys Asn Lys Thr Val Tin Val 

2 16 2 15 2 2 0 
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Asp Asp Glu Ala Gly Val Arg Ala Tyr Arg Val lie Lys Asp Leu Val 
225 230 235 <-40 

Asp Ser Lys Ala Ala He Thr Asp Ala Set Asp Gly Tip Asm Asn Met 
24 5 2 50 2 5 5 

Gin Asn Ala Phe Lys Ser Gly Lys Val Ala Met Met Val Asn Gly Pro 

260 265 270 

Trp Ala lie Glu Asp. Val Lys Ala Gly Ala Arg Phe Lys Asp Ala Gly 

27 5 2 80 26 5 

Asn Leu Gly Val Ala Pro Val Pio Ala Gly Ser Ala Gly Gin Gly Ser: 

290 295 300 

Pro Gin Gly Gly Trp Asn Leu Sei Val Tyr Ala Gly Ser Lys Asn Leu 

305 310 315 320 

Asp Ala Ser Tyr Ala Phe Val Lys Tyr Met Ser Ser Ala Lys Val Gin 

325 330 335 

Gin Gin Thr Thr Glu Lys Leu Ser Leu Leu Pro Thr Arg Thr Ser Val 
340 345 350 

Tyr Glu Val Pro Ser Val Ala Asp Asn Glu Met Val Lys Phe Phe Lys 
355 360 365 

Pro Ala Val Asp Lys Ala Val Glu Arg Pro Trp lie Ala Glu Gly Asn 
370 375 380 



Ala Leu Phe Glu Pro He Arg Leu Gin 
385 390 



